All in Strings: a Powerful String-based Automatic MT Evaluation Metric with Multiple Granularities
نویسندگان
چکیده
String-based metrics of automatic machine translation (MT) evaluation are widely applied in MT research. Meanwhile, some linguistic motivated metrics have been suggested to improve the string-based metrics in sentencelevel evaluation. In this work, we attempt to change their original calculation units (granularities) of string-based metrics to generate new features. We then propose a powerful string-based automatic MT evaluation metric, combining all the features with various granularities based on SVM rank and regression models. The experimental results show that i) the new features with various granularities can contribute to the automatic evaluation of translation quality; ii) our proposed string-based metrics with multiple granularities based on SVM regression model can achieve higher correlations with human assessments than the stateof-art automatic metrics.
منابع مشابه
Normalized Compression Distance as automatic MT evaluation metric
This paper evaluates a new automatic MT evaluation metric, Normalized Compression Distance (NCD), which is a general tool for measuring similarities between binary strings. We provide system-level correlations and sentence-level consistencies to human judgements and comparison to other automatic measures with the WMT’08 dataset. The results show that the general NCD metric is at the same level ...
متن کاملA Fast and Accurate Global Maximum Power Point Tracking Method for Solar Strings under Partial Shading Conditions
This paper presents a model-based approach for the global maximum power point (GMPP) tracking of solar strings under partial shading conditions. In the proposed method, the GMPP voltage is estimated without any need to solve numerically the implicit and nonlinear equations of the photovoltaic (PV) string model. In contrast to the existing methods in which first the locations of all the local pe...
متن کاملNormalized Compression Distance Based Measures for MetricsMATR 2010
We present the MT-NCD and MT-mNCD machine translation evaluation metrics as submission to the machine translation evaluation shared task (MetricsMATR 2010). The metrics are based on normalized compression distance (NCD), a general information theoretic measure of string similarity, and evaluated against human judgments from the WMT08 shared task. The experiments show that 1) our metric improves...
متن کاملThe Role of Pseudo References in MT Evaluation
Previous studies have shown automatic evaluation metrics to be more reliable when compared against many human translations. However, multiple human references may not always be available. It is common that automatic metrics must make judgments based on a single human reference (extracted from parallel texts) or no reference at all. Our earlier work suggested that a promising way to address this...
متن کاملEffects of Disc Insulator Type and Corona Ring on Electric Field and Voltage Distribution over 230-kV Insulator String by Numerical Method
Insulator strings with several material and profiles are very common in overhead transmission lines. However, the electric field and voltage distribution of insulator string is uneven which may easily lead to corona, insulators’ surface deterioration and even flashover. So the calculation of the electric field and voltage distribution along them is a very important factor in the operation time....
متن کامل